Journal of Research of the National Bureau of Standards 



Vol. 51, No. 1, July 1953 



Research Paper 2429 



Eigenvectors of Matric Polynomials 

Murray Mannos 

It is the object of this paper to compare the eigenvectors of an arbitrary nXn matrix . 1 
over the complex field with those of the matric polynomial f(A). While it is well known 
that each eigenvector of A is an eigenvector of f(A), it is not, in general, true that A and 
f(A) have identical eigenvectors. In this regard a necessary and sufficient condition that 
A and f(A) have identical eigenvectors is given. The condition is that both (1) and (2) hold: 

(1) f{\) t±0 for all eigenvalues X of the matrix A corresponding to nonlinear elementary 
divisors. 

(2) The values of /(/*) are distinct for all eigenvalues fi of the matrix A corresponding 
to linear elementary divisors. 

When either (1) or (2) fails to hold, then f(A) has eigenvectors thai are not eigenvectors of 
A. This situation is also discussed. 



The vector space of eigenvectors of the matrix A 
corresponding to the eigenvalue X shall be denoted 
by 1 \[A]. The vector space spanned by the eigen- 
vectors of A shall be denoted by V[A\. Further, 
f/{T\|y4]} and <i{F[^l]} shall denote their dimensions. 
It is clear that each eigenvector of A is an eigen- 
vector of f{A). That is, V x [A]^V m lf(A)] for each 
eigenvalue X of A. Thus V[A] c V[f(A)]. 

Let J be the Jordan canonical form of A. Then 
there exists a nonsingular matrix P such that 
P~ l AP=J and so P~f{A)P=f{J). 

Lemma 1 . The eigenvectors of A and f(A) are iden- 
tical if, and only if, the eigenvectors of J and f (J) an 
identical . 

Proof. This follows from the fact that P£ is an 
eigenvector of PBP~ l if £ is an eigenvector of the 
matrix B. 

Since P{V\[J]} = V\[A], it follows that 
d{ VIM)} }~d{ V[A] } =d{ V\j{J)] } -d{ V[J] } , 
where P{V\[J]} denotes the space of all vectors P£, 
where £ is an eigenvector of J corresponding to X. 

Lemma 2. If D=diag [a,a,. . . ,a; (3,13, . . .,0; 
. . .; 7r,7r, . . .,7r], where a,/?, . . .,w are distinct, then 
V[D] = V[f(D)]. Furthermore, the eigenvectors oj D are 
identical with those of f(D) if, and only if, /(or), /(j8), 
. . .,/(7r) are all distinct. 

Proof. If D is of order n, it is clear that both the 
eigenvectors of D and/(Z>) each generate the whole 
n-dimensional vector space, If /(a)=/(/3), then it is 
easily seen that f(D) has eigenvectors corresponding 
(o the eigenvalue /(a)=/(|8), which are not eigen- 
vectors of D. 

Lemma 3. If 

"X 1 ... 0" 



JmW-- 



x l 












X 1 

o o xj 



i Preparation of this paper was sponsored (in part) by the Office of Scientific 
Research, USAF. 



tin n 



/(/«(X)) = 



/(X) /'(\) 
f(\) /'(A) 



/"(a) .r"(x) 



2! 3! 

f"M 
2! 







o /(x) r(x) 



/<--'> (X) ' 

(m-l)\ 

/<"»-*> (X) 
(m-2)! 

/(m-«(X) 

(w-3)! 



/(X) /'(X) 
/(A) J 



Proofsof the lemma may be round in Wedderburn 2 
and MacDuffee. 3 

Lemma 4. // f (X) =/" (Xj = . . =f r) (\) = Q and 
yc-M)(X)^0 (r=l,2, . . . \m-2), then 

d{ V m [f(JmO))] } ~d{ Vx[J m (X)] } =r, 
and von nisely. Also. 



d{V m [f(Jr,W)]}-d{VAJ m W}}-- 



■m- 



if, and only if, /' (X) =/" (X) = . . .=/-*(X)=0. 

Proof. If f(X)=/"(X)= . . . =f>(X)=0, but 
p+i)(X)^0, it follows that the rank of the matrix 
f(J?nW)—fWI is m— (r+1), as may be observed by 
noting the nonzero minor of order m— (r+1) in the 
upper right-hand corner of the matrix. Thus the 
nullity of /(«7»(X))— /(X)J is r+1. If the first ra-1 
derivatives of f(x) vanish at o,=X, then the nullity 
of f(J m (k))—f(\)I is m. In a similar fashion one 
finds that the matrix J m (\) — \I has nullity equal 
to 1. This establishes the lemma. 

Corollary. The matrices J m (\) and/(e/ m (X)) have 
identical eigenvectors if, and only if,/ / (X)^0. 

It is to be noted that in this case V fi \)[f(J m (X))] 

2 J. H. M. Wedderburn, Lectures on Matrices, (Am. Math. Soc, Colloquium 
Pub. vol. 17, New York, N. Y., 1934). 

3 C. C. MacDuffee, An Introduction to Abstract Algebra (John Wiley & Sons, 
New York, N. Y., 1940). 
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and T"x[e/ m (X)] are each spanned bv the single column 
vector [1,0, . . .,0] r . 

At this point it will be convenient to introduce the 
language of elementary divisors. Each block in the 
Jordan canonical form of a matrix A corresponds to 
an elementary divisor (x— \) m , and conversely. 
Each such block is called the hypercompanion matrix 
J m (\) of the polynomial (x—\) m . 

Suppose the nonlinear elementary divisors of 
the matrix A corresponding to the eigenvalue X 

are of the form (x-\)< ]) , (x-\) m ?\ . . ., (a— X)"^, 
where m^ (i=l,2, . . ., k\) are integers such that 
mj[ l) >ra x (2) >. . .y>m^^>l. Furthermore, suppose 

that (x— \) m i l) appears as an elementary divisor nl () 

fcx 

times. Let p\=^ J n£ i) denote the total number of 

t=\ 
nonlinear elementary divisors of A corresponding to 
the eigenvalue X. Denote by n\ the number of 
linear elementary divisors of A corresponding to the 
eigenvalue X. Further, denote by K\ the direct sum 
of the hypercompanion matrices of the elementary 
divisors corresponding to X. Set 

dx=d{V m [f(K x )]}-d{V\[K x ]}. 

Lemma 5. If the first m[ l) — l derivatives of fix) 

fcx 

vanish at x—\ then d\=y^,nl i) (ml** — 1) ; whereas, if 
the first r<^m^ — 1 derivatives of f(x) vanish at x = \, 

l-l ' " k\ 

but /<'+*> (\)^0, then 4=S<^+S^" ) W ) -1) ) 

i=l i=l 

where m" } is the largest of the integers m^ (i= 1 ,2,...,& x ) 
such that ml l) <r. 

Prooj. This follows from lemma 4, the fact that 
the nullity of K\—\I is the sum of the nullities of 
the characteristic matrices of the hypercompanion 
matrices of the individual elementary divisors corre- 
sponding to the eigenvalue X and a similar statement 
about the nullity of f(K x )-f(\) I. 

Corollary. The matrices K\ andf(K\) have identical 
eigenvectors if, and only if, /'(X)^0. 

Proof. This follows from the fact that d\=0. 

The Jordan canonical form J=dmg [K ai Kp,. . .,K n ] 
of A is a direct sum of matrices K\, where \=a,fi, . . . 
runs through the distinct eigenvalues of A. The 
subsequent theorems of this paper involve the fol- 
lowing main conditions: 

Condition 1. /'(X)^0 for all eigenvalues X of the 
matrix A corresponding to nonlinear elementary 
divisors. 

Condition 2. The values /(/*) are distinct for all 
eigenvalues ix of the matrix A corresponding to linear 
elementary divisors. 

The first theorem concerns the case where condi- 
tion 1 does not hold, whether condition 2 holds or not . 

Theorem 1 . If condition 1 does not hold, then 
d{V\j{A)]}-d{V[A]}=J±d h 

X 

where X varies through all distinct eigenvalues of A 
corresponding to nonlinear elementary divisors, and 



d\ is computed as in lemma 5. 
Proof. One notes first that 



and 
T'W)]= 



V[J] = V a [J] + V,[J] + . . .+V.[J], 



T 
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If f(a), /(jS), . . .,/(?r) are distinct, for a fixed eigen- 
value X, the nullity of J—\I is the same as the 
nullity of i£ x -X/, and the nullity of f(J)-f(\)I is 
the same as the nullity of f(K\)—f(\)I. It follows 
that 

d{V[ m \f(J)]}-d{Vx[J]}=dx. 

Summing over distinct eigenvalues, one obtains 

d{v[f(j)\}-d{v[j]}=z:d,. 



By the statement following the proof of lemma 1 it 
follows that 



d{VU(A)]}-d{Y[A]}: 



X 



If f( a )=f(p) = . , . =/( p ), then the nullity of 
f(J)—f(a)I minus the sum of the nullities of J—al, 
J—fil, . . ., and J— pi is d a -\-dp-{-. . .-\-d p and 

d{V f UKJ)]}-d{V a [J]}-d{V [J]}-. . .-d{Y p [J]} 

= d a + dfi+. . .+d p . 

Summing, one obtains in either case 

d{V{f(A)]}-d{V[A]}=T,di, 

X 

where X runs through all the distinct eigenvalues of 
A corresponding to nonlinear elementary divisors. 

The next theorem concerns the case in which 
condition 1 holds but condition 2 does not. 

Theorem 2 . If condition 1 holds, than V[A] = V\f(A)\ . 

In this case ^2d\=0, where X varies over all dis- 
x 
tinct eigenvalues corresponding to nonlinear elemen- 
tary divisors. 

If/(tt)==/(j8) = . . .=/(p), suppose thai 



d{V fia) \f(J)]}=d{V a [J]} 

+d{V fi [J]} + 



-d{V,[J]) 



Then as in lemma 2 the ^-dimensional space 
Va[</] + T /3[J] + . . . + 1 ; P [J] contains vectors that are 
not eigenvectors of J, but T\ (a) [/(</)] consists solely 
of eigenvectors of /(</) corresponding to f(a). 
Corollary. If f( a )=f(p) = . . ,=/(p), then 

d{ V f(a) [f(A)] } -d{ T \[A] } =24+ (px+*x), 

X 

where X runs over all eigenvalues among a, /?,..., p 
which correspond to nonlinear elementary divisors and 
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, 



where p\-\-n\ is the total number of elementary divisors 
(nonlinear and linear) corresponding to X. 

Finally, the next result covers the case in which 
conditions 1 and 2 both hold. 

Theorem :*. The matrices A andf(A) have identical 
eigenvectors if, and only if (1) /'(X) ^0 for all eigen- 
values X of the matrix A corresponding to nonlinear 
elementary divisors, and (2) the values f(n) are dis- 
tinct for all eigenvalues n of the matrix A corresponding 
to linear elementary divisors. 

Remark 1 . It may readily be seen that V\J] and 
V\f(J)\ each can be generated by a set of linearly 
independent vectors, each of which has a 1 in a single 
component and in all the remaining components. 
Thus if J=P~ l AP is the Jordan canonical form of 
A, than F|7(A)] can be generated by a subset of the 
column vectors of P. 

Remark 2. A simple application of the foregoing 
theory shows that if A is a 2X2 matrix, then A and 
f{A) have the same eigenvectors unless either (1) A 
is diagonable and has distinct eigenvalues a, for 
which f(a)=f(p), or (2) A is nondiagonable and 
f(x)=k(x)(x—a) 2 +c, where a is the eigenvalue of A, 
k(x) is an arbitrary polynomial, and c is an arbitrary 
const a n I . 

The following example 4 is given as an illustration: 
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andf(x)=x«+3 5 -x*-2 z +2. 



For P= 
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it may be verified that 
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J=P-K\P 



and that the Jordan canonical form of A is 
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where 
C K: l =/ 1 (i)=[i], 

"0 1 0" 
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, / 3 (0) = 
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/i(0) = f0], 



/v-, = / 2 (-l) 



[,: :i 



Since 



/(0)=/(l)=/(-l)=2, 
/'(0)=/'(-l) = 0, /' (1)5*0, 
J"(0)=0, 



it follows that 



/(/) = /•+/«- J 4 -/ 3 + 27 = 



"2 0* 

2 

2 

2 

2 

2 

2. 



It is readily seen that \\[J] is generated by 
[1,0,0,0,0,0,0] 7 ',* V Q [J] is generated by [0,l,0,0,0,0,0] r 
and [0,0,0,0, \,0,()] T , and V_x[J] is generated by 
[0,0,0,0,0,1, or Since F[J] = F 1 [e/] + F [e7] + F_ 1 [J], 
it follows that d{ V[J] } =4. Since f[J] is a scalar ma- 
trix, d{ V[f( J)]} = 7. Hence d{\f(A)]}-d{V[A]} = 3. 
The same result is arrived at by the use of theorem 1 , 
where y^.dx is calculated as in lemma 5. Since 

/'(0) = 0,/"(0)=0, and/'(-l) = 0, where 0, -1 are 
the eigenvalues corresponding to nonlinear elemen- 
tary divisors, it follows that 



and 



d =n^(m^-l) = 1-2=2 
rf_ 1 =^ 1 (1) r=l.l = l 
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and 



24=^o+d-i=2 + l=3, 



as before. By the corollary following theorem 2, 
d{V 2 \f(A)]}-d{V 1 [A]}=d Q +d„ 1 +p +p- 1 +n +n„ 1 

= 2 + 1+1 + 1 + 1+0 = 6, 
which checks with the observed results above. 



From a glance at the above eigenvectors generating 
V[J], it is clear that V[A] is generated by the first, 
second, fifth, and sixth-column vectors of P. Since 
j{A) is a scalar, V\J(A)] is generated by all seven 
column vectors of P. 



The author thanks A. H. Clifford, O. Taussky, 
and N. Wiegmann for their helpful suggestions. 

Washington, December 1, 1952. 
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